Nonuniversal distributions of stock returns in an emerging market 
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There is convincing evidence showing that the probability distributions of stock returns in mature 
markets exhibit power-law tails and both the positive and negative tails conform to the inverse cubic 
law. It supports the possibility that the tail exponents are universal at least for mature markets in 
the sense that they do not depend on stock market, industry sector, and market capitalization. We 
investigate the distributions of one-minute intraday returns of all the A-share stocks traded in the 
Chinese stock market, which is the largest emerging market in the world. We find that the returns 
can be well fitted by the g-Gaussian distribution and the tails have power-law relaxations with the 
exponents fluctuating around a = 3 and being well outside the Levy stable regime for individual 
stocks. We provide statistically significant evidence showing that the exponents logarithmically 
decrease with the turnover rate and increase with the market capitalization, and find that the 
market capitalization has a greater impact on the tail exponent than the turnover rate. Our findings 
indicate that the intraday return distributions are not universal in emerging stock markets. 

PACS numbers: 89.65.Gh, 89. 75. Da, 05.45.Tp 



I. INTRODUCTION 

The logarithmic return of stock price S{t) over a time 
interval Ai is defined as follows, 

r{t)^\iiS{t)-\uS{t- M). (1) 

The form of the distribution of asset price fluctuations 
plays crucial roles in asset pricing and risk management 
[ll-Q. Early empirical and theoretical works, which can 
be traced back to Bachelicr in 1900 [1], argue that as- 
set prices follow geometric Brownian motions, i.e., the 
returns are normally distributed [1, Q • More than half 
a century after Bachelier's work, Mandelbrot finds that 
incomes and speculative price returns follow the Pareto- 
Levy distribution, which has power-law tails 

f{\r\)-W-\ (2) 

whose exponents 1 < a < 2 Q . The picture of Paretian 
markets soon becomes the mainstream Q. 

In recent years, the Boston school shows that the tail 
distributions of returns of many stock indexes and stock 
prices for the USA markets exhibit an inverse cubic law, 
where the power-law exponents are close to a = 3 
A lot of empirical investigations have been conducted on 
financial returns at different time scales Ai in different 
stock markets over different time periods, in which the 
distributions vary from exponential to stretched expo- 
nential to power-law fat-tailed pea. This miscella- 
neous phenomenon is rational since the tails are expected 
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to evolve from power law at small time scale to Gaussian 
at large scale [T^ , according to the variational theory for 
turbulent signals Il9| . which has been confirmed by nu- 
merous studies [9l.ll0l[20|. Note that the stretched expo- 
nential distribution serves as a b ridg e between exponen- 
tial and power- law distributions [20l | . Under certain 
conditions the stretched exponential density approaches 
to a power law, which is also supported by empirical ev- 
idence [3,im. 

For intraday returns, the prevailing point is that the 
distributions have power-law tails outside the Levy sta- 
ble regime. Many efforts have been made to explain this 
power-law behavior. Base on ultra-high-frequency data, 
some researchers argue that large price fluctuations are 
caused by large trading volume [22|, while some others 
submit that news and volume plays a minor role and 
the shortage of liquidity is the main cause [23^ . The 
non-Gaussian behavior of stock prices can also be ex- 
plained by the mixture-of-distribution hypothesis [TtI . [23 | 
or nonextensive statistical mechanics [l4|, [2^ . In addi- 
tion, different classes of microscopic stock market models 
are also able to reproduce the power-law tailed distribu- 
tions [13. 

Despite the variation of tail exponents reported in 
the econophysics literature, there is convincing evidence 
showing that the tail exponents for stocks in western 
mature markets are possibly universal based on a care- 
ful study of the trade-by-trade data of 1000 major USA 
stocks, 85 major stocks traded on the London Stock Ex- 
change which form part of the FTSE 100 index, and 13 
major stocks traded on the Paris Bourse that form part 
of the CAC 40 index . It shows that the exponent 
values are a ~ 3 for the three markets and do not dis- 
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play variations with respect to market capitalization or 
industry sector. 

In this paper, we investigate the distributions of one- 
minute intraday returns of all the A-share stocks traded 
in the Chinese stock market, which is the largest emerg- 
ing market in the world. Our aim is to test the possible 
dependence of the tail exponents on the turnover rate 
and market capitalization. There is statistically signifi- 
cant evidence showing that the exponents increase with 
market capitalization and decrease with turnover rate. It 
indicates that, different from developed markets, the in- 
traday return distributions are not universal in emerging 
stock markets. 



II. DATA SETS 

We employ a nice tick-by-tick database of the A-share 
stocks for all companies traded on the Shenzhen Stock 
Exchange and the Shanghai Stock Exchange from Jan- 
uary 2004 to June 2006. The data were recorded based 
on the market quotes disposed to all traders in every six 
to eight seconds, which are different from the ultrahigh- 
frequcncy data reconstructed from order flows [28| . 

We compute the one-minute intraday returns for each 
stock. We emphasize that the intraday returns are cal- 
culated within individual trading days to eliminate the 
overnight effect [l^, [2^ . For each stock, the one- minute 
returns are normalized so that the mean is and the 
variance is 1. 



III. DEPENDENCE OF RETURN 
DISTRIBUTIONS ON TURNOVER RATE 

We investigate the possible impact of the turnover rate 
on the distribution of stock returns, especially on the 
power-law tail exponent. For each stock, we first calcu- 
late the total traded value of all trades in each minute 



that the prices of individual stocks have a tick size [T^ . 
This phenomenon disappears when we investigate stock 
indices M- 
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FIG. 1. (Color online) Empirical distributions of l-min stock 
returns for the 20 groups partitioned based on the average 
turnover rate. 
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FIG. 2. (Color online) Dependence of the exponent a esti- 
mated from the g- Gaussian function with respect to the aver- 
age turnover rate {v/c}. The solid line gives the logarithmic 
regression a — A + B\n{v/c), where B = —0.09 ± 0.03 and 
= 0.40. 



where n is the number of trades, Vi is the size of the i-th 
trade, and pi is the price of the i-th trade. The l-min 
average traded value (v) is the average over all l-min 
time interval for the stock and the l-min turnover rate is 
calculated by the ratio of average traded values to market 
capitalization. 

We sort all the stocks according to their l-min turnover 
rates and partition them into 20 groups, ensuring that the 
groups contain almost identical number of stocks. The 
l-min returns of the stocks in the same group are pooled 
as one sample. Figure [1] shows the empirical distribu- 
tions of stock returns for the 20 groups. We find that 
all these distributions share a qualitatively similar shape 
with fat tails. A careful scrutiny shows that the curves 
are not smooth around r = 0. This is due to the fact 



For the l-min returns of Chinese stocks, the distribu- 
tions can be well fitted by Student's t-distribution [l^ 



Mr) = Vln^/yB 



1 n 
2' 2 



[a + Lr ) , (4) 



where B{-,-) is the "beta function", L is the scale pa- 
rameter, and a is the degrees of freedom. The Student 
distribution is also known as the q-Gaussian distribution 
in nonextensive statistical mechanics [13, [l^l • We fit the 
20 curves using Eq. (j4|) and plot the estimated exponent 
a as a function of {v /c) in Fig. [2l We adopt a logarithmic 
form 



a 



A + Blii{v/c), 



(5) 
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and a linear regression gives B = —0.09 ± 0.03, where 
the error is determined according to the standard t-test 
at the 5% significance level. There is a decreasing trend 
between a and {v/c). 

The estimates of the tail exponents based on the fitting 
of the g-Gaussian function might be biased since the bulk 
of the distribution with not large returns has dominat- 
ing impact on the objective function of the fitting. Wc 
thus use a well designed method proposed for estimating 
tail exponents, which is known as the CSN method [3]| . 
We briefly review the idea of the CSN method for pos- 
itive returns, which can be easily extended to negative 
returns. The CNS method has a promising advantage 
to determine the cutoff rmin in an objective way. After 
the cutoff rmin is determined, the tail exponent of the re- 
turns r ^ Tmin can be determined based on the maximal 
likelihood estimation Isil. 



N 

iV^ln- 
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where N is the number of returns r that are no less than 



, . In order to determine ; 



one needs to scan differ- 



ent values of r to determine the corresponding parame- 
ters and obtain the Kolmogorov-Smirnov or KS statistics. 
The optimal cutoff rmin is the one that minimizes the KS 
statistic. 
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FIG. 3. (Color online) Dependence of the tail exponents of 
positive and negative returns for the 20 groups as a function of 
the 1-min average turnover rate {v/c). The solid lines are the 
best fitsof a = A + Bln {v/c), where B = -0.15 ±0.04 for the 
positive tails with the p-value equaling to and — 0.45, 
and B = —0.20 ± 0.02 for the negative tails with the p-value 
equaling to and = 0.84, respectively. 



there are two outliers that deviate remarkably from the 
linear trend. If we discard these two points, the results 
for positive and negative returns are very similar. 




FIG. 4. (Color online) Dependence of the tail exponents of 
returns for individual stocks as a function of the 1-min average 
turnover rate {v/c). The solid lines are the best fits of a = 
A + B\n {v/c), where B = -0.23 ± 0.03 for the positive tails 
with the p-value equaling to and R? — 0.07 and B — — 0.24± 
0.03 for the negative tails with the p-value equaling to and 
R? = 0.08, respectively. The dashed line is a = 2. 



We also apply the CSN method to the returns of indi- 
vidual stocks and the exponents of positive and negative 
tails arc calculated. The dependence of the tail expo- 
nents of returns for individual stocks as a function of the 
1-min turnover rate is presented in Fig. 21 Fitting the 
data to Eq. (O, we have B = -0.23 ± 0.03 for the pos- 
itive tails with i?^ = 0.07 and B = -0.24 ± 0.03 for the 
negative tails with i?^ = 0.08, respectively. Although the 
R-squarc values are small, the slopes B are significantly 
different from 0, which implies that the tail exponents do 
depend on the turnover rate. In addition, we find that 
the positive and negative tails are roughly symmetric. 

All the results presented above show that the tails are 
fatter with smaller exponents a for larger turnover rates. 
This is consistent with the conventional wisdom that a 
stock with high turnover rate is riskier and has higher 
volatility. Usually, stocks with small market capitaliza- 
tion have higher turnover rates, which is especially true 
for the Chinese stock market. Therefore, it is possible 
that market capitalization may also have an important 
impact on the heaviness of the return tails. 



Figure |3] shows the dependence of the tail exponents 
of positive and negative returns for the 20 groups as a 
function of the 1-min average trading value. We fit the 
data to the logarithmic function ([5]) and obtain that B = 
-0.15 ± 0.04 for the positive tails with = 0.45 and 
B = -0.20 ± 0.02 for the negative tails with = 0.84, 
respectively. We find that the tail exponents decrease 
with the turnover rate. For the positive return curve. 



IV. DEPENDENCE OF RETURN 
DISTRIBUTION ON MARKET 
CAPITALIZATION 

We now investigate the possible impact of market cap- 
italization on the distribution of stock returns, especially 
on the power-law tail exponent. Wc note that the own- 
ership of stock shares is split and only about one-third of 
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the total outstanding shares are tradable in the market 
during the time period under investigation. The market 
capitalization is calculated as the product of the out- 
standing tradable shares and the price at the beginning 
of the database records. 
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FIG. 5. (Color online) Empirical distributions of 1-min stock 
returns for the 20 groups partitioned based on the average 
market capitalization. 



We sort all the stocks according to their market cap- 
italization and partition them into 20 groups, ensur- 
ing that the groups contain almost identical number of 
stocks. The 1-min returns of the stocks in the same group 
are pooled as one sample. Figure [5] shows the empirical 
distributions of stock returns for the 20 groups with dif- 
ferent average market capitalization. It is found that the 
shape of the curves is very similar to that in Fig. [TJ Com- 
paring Fig. [5] and Fig. [TJ we observe that the curves in 
Fig. [5] are less collapsed with each other. It can be con- 
jectured that the correlation between tail exponent and 
market capitalization is stronger than the turnover rate, 
which is exactly the case as we will show below. 

We estimate the tail exponents using the g-Gaussian 
model and the CSN method as in Sec. lIIIl For each group 
of stocks, we obtain three exponents. Figure |6] illustrates 
the dependence of the tail exponents of 1-min returns 
for the 20 groups as a function of the average market 
capitalization. We observe that all the three exponents 
exhibit an increasing linear trend in semi-logarithmic co- 
ordinates. To fit the three curves, we adopt a logarithmic 
function 



A + B ln(c) 



(7) 



Linear regressions of a with respect to In c give that B = 
0.04 ± 0.02 with R"^ = 0.32 for the q-Gaussian model, 
B 0.15±0.02 with R'^ = 0.80 for the positive tails, and 
B = 0.14 ± 0.02 with R^ = 0.76 for the negative tails, 
respectively. Comparing with the results in Sec. IIV[ the 
R-square values in the present case are much larger and 
the slopes are all significantly different from according 
to the t-tests. In other words, the tails for small-cap 
stocks are fatter than large-cap stocks. This finding is 
consistent with the conventional wisdom in finance that 



small-cap stocks are riskier and have more occurrences of 
large price fiuctuations. 




FIG. 6. (Color online) Dependence of the tail exponents of 
returns for the 20 groups as a function of the average market 
capitalization. The tail exponents are estimated from the q- 
Gaussian model and the CSN method. The three sets of data 
points are fitted to the logarithmic function ([7]). We obtain 
that B = 0.04 ± 0.02 with = 0.32 for the g-Gaussian 
model, B = 0.15 ± 0.02 with R^ = 0.80 for the positive tails, 
and B = 0.14 ± 0.02 with R^ = 0.76 for the negative tails, 
respectively. 
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FIG. 7. (Color online) Dependence of the tail exponents of 
returns for individual stocks as a function of the market cap- 
italization. The solid lines are the best fits oi a = A-\- B\nc, 
where B = 0.22 ± 0.02 for the positive tails with R^ = 0.10 
and B = 0.22 ± 0.03 for the negative tails with R^ = 0.12, 
respectively. 

We also apply the CSN method to the returns of indi- 
vidual stocks and the exponents of positive and negative 
tails are calculated. Figure [7] shows the dependence of 
the tail exponents of returns for individual stocks as a 
function of the average market capitalization. Fitting 
the data to Eq. ([7]) where (c) is replaced by c, we have 
B = 0.22 ± 0.02 for the positive tails with i?^ = 0.10 and 
B = 0.22 ± 0.03 for the negative tails with i?^ = 0.12, 
respectively. We find that the slopes B are significantly 
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different from 0. We also observe that the B values in 
Fig. [7] for individual stocks are larger than those in Fig. [5] 
for grouped stocks. In addition, we find that the positive 
and negative tails are roughly symmetric. 

V. BIVARIATE REGRESSION 

In Sec. mil and Sec. IIVI we have shown that the tail 
exponents arc dependent of the average turnover rates 
and the market capitalization in logarithmic forms. It 
is thus natural to combine these results and suggest a 
bivariate logarithmic function as follows 

a = A + B^^/,, \n{v/c) + B^ In c. (8) 

This test can only be done for individual stocks since 
the two grouping methods in the previous sections are 
different. 

We perform linear least-squares regressions for positive 
and negative tails. The estimated parameters for positive 
tails are obtained that A = -1.13±0.41, B^/c -0.07± 
0.04, and B^ = 0.18 ± 0.03, whose R-square is i?^ = q.II 
and the p-values are 0.006, 0.077, and 0, respectively. 
For negative tails, the parameters are A = —1.23 ± 0.39, 
B^^^ = -0.06 ± 0.03, and B^ = 0.19 ± 0.03, whose R- 
square is i?^ ~ 0.13 and the p-values are 0.001, 0.066, 
and 0, respectively. A comparison of the three regression 
models expressed in Eqs. (O, ([7]) and ^ is presented in 
Table H 

TABLE I. Comparison of the three regression models pre- 
sented in Eqs. ©, (O and ((SJ. The number of stars in the 
superscript of a coefficient indicates its significance level: * * * 
for 1% and * for 10%. 
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Negative tail 
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Be 
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• / 


0.07 


-0.24*** / 


0.08 
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0.22*** 


0.10 


/ 0.22*** 


0.12 


m 


-0.07* 


0.18*** 


0.11 


-0.06* 0.19*** 


0.13 



The bivariate regression results are consistent with 
those in the univariate regressions. That is, the tail ex- 
ponents increase with market capitalization and decrease 
with turnover rate. For both univariate model ([7]) and 
bivariate model ([S]) , the coefficients of market capitaliza- 
tion are significant at the 1% level. The coefficients of 
turnover rate arc significant at the 1% level in the uni- 
variate model (O and significant only at the 10% level in 
the bivariate model In addition, the introducing of 
Inc in model ([Sj improves its explanatory power (char- 
acterized by R^) by 4% to 5%, while introducing \n{v/c) 
in model ([T]) improves its explanatory power by 1%. All 
these findings imply that market capitalization is a more 
significant infiucncing factor of the tail heaviness. 

A simple manipulation of Eq. ([5]) reads 

a^A + B^/c\n{v) + {Bc-B^/c)\nc, (9) 



which means that the average traded value might also 
have an impact on the tail heaviness. We have investi- 
gated the univariate model by posing Be — B^j^ ~ in 
Eq. ©, that is 

a = A + B\n{v), (10) 

We partition the stocks into 20 groups according to 
their 1-min average traded values. From the g-Gaussian 
model, we find that B = -0.06 ± 0.04 with = 0.12. 
In contrast, the CSN method gives that B = 0.01 ± 0.02 
with i?2 = 0.02 for positive tails and B = 0.02 ± 0.02 
with R^ ~ 0.03 for negative tails. For individual stocks, 
we have B = 0.16 ±0.03 with R^ = 0.03 for positive tails 
and B = 0.17 ± 0.03 with i?^ = 0.04 for negative tails. 
Different from the results of turnover rate and market 
capitalization, the results of average traded value from 
different method are inconsistent with each other. 



VI. CONCLUSION 

We have investigated the distributions of one-minute 
intraday returns of all the A-share stocks traded in the 
Chinese stock market, which is the largest emerging mar- 
ket in the world. The returns are standardized to have 
zero mean and unit variance. We studied the possible im- 
pact of the turnover rate and the market capitalization 
on the return distributions with special attention paid to 
the tail behavior. 

For individual stocks, the returns are found to have 
power-law tails. Basically, the tail exponents fluctuate 
around a = 3, ranging from a = 2 to a = 5. It indicates 
that the return distributions of individual Chinese stocks 
arc well outside the Levy stable regime. When the stocks 
arc grouped according to their turnover rates, market 
capitalization or traded values, the returns in each group 
can be well fitted by the g-Gaussian formula. 

We found from different methods that the tail expo- 
nents logarithmically decrease with the turnover rate and 
increase with the market capitalization and the market 
capitalization has a greater impact than the turnover 
rate. These observations are consistent with the fact that 
a stock is riskier and has fatter tails when its capitaliza- 
tion is small and it has higher turnover rate. However, we 
did not find convincing evidence for the impact of traded 
value on the tail behavior. We conclude that the intraday 
return distributions are not universal in emerging stock 
markets, which is different from the universal tail distri- 
bution for stocks in developed western stock markets. 
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